[PLT-1008] Export content parses incorrectly #1628

adrian-chang · 2024-05-24T00:35:48Z

Introduce Buffered Stream
Prep deprecation of non-buffered methods
Fixup errors / result in export.task to use bufferedstream

libs/labelbox/src/labelbox/schema/export_task.py

vbrodsky

I see the approach and agree with it. The code looks right, not enough deep understanding on my end to dig into each and every line.

Gabefire · 2024-05-26T17:09:29Z

Are you technically deprecating the FileConverterType with this change since the buffered stream does not look like you can pass custom converters?

Gabefire · 2024-05-26T17:30:45Z

NIT: one last thing: would it make sense to change the default value for the convertor in the get_stream method to the buffered version so the deprecation messages do not fire twice/ clean up the method easily? This is for this workflow.

adrian-chang · 2024-05-27T01:34:41Z

Are you technically deprecating the FileConverterType with this change since the buffered stream does not look like you can pass custom converters?

Sort of. I think the way it is written is a bit incorrect (yielding a file constantly is very odd in my view) but the main idea of yielding an entire file is correct.

The Export_Task code is very over engineered and buggy and needs to be simplified.

We should aim to give customers basically 3 outputs: fully outputted in memory, line by line in memory, or an entire file by disk and that's it.

Allowing for a lot of customization such as choosing what chunk to export at (offset) or line makes etc. just makes our life more difficult maintaining the SDK.

adrian-chang · 2024-05-27T01:36:14Z

NIT: one last thing: would it make sense to change the default value for the convertor in the get_stream method to the buffered version so the deprecation messages do not fire twice/ clean up the method easily? This is for this workflow.

No. BufferedJsonOutput has a schema of { json: any } versus { json_str: str }. Changing the convertor is a potential breaking change.

This is also why buffered_stream was introduced.

adrian-chang requested a review from a team as a code owner May 24, 2024 00:35

adrian-chang temporarily deployed to Test-PyPI May 24, 2024 03:04 — with GitHub Actions Inactive

adrian-chang temporarily deployed to Test-PyPI May 24, 2024 03:50 — with GitHub Actions Inactive

vbrodsky reviewed May 24, 2024

View reviewed changes

libs/labelbox/src/labelbox/schema/export_task.py Show resolved Hide resolved

vbrodsky previously approved these changes May 24, 2024

View reviewed changes

adrian-chang changed the title ~~[PLT-0] fix bug meta's~~ [PLT-1008] Export content parses incorrectly May 24, 2024

adrian-chang dismissed vbrodsky’s stale review via 8bfe14c May 25, 2024 07:11

adrian-chang force-pushed the achang/plt-0-fix-bug-meta branch from 28d672c to 8bfe14c Compare May 25, 2024 07:11

Adrian Chang added 6 commits May 25, 2024 22:34

ignore meta file partials

718ac7f

Buffered stream

f9e76b1

Buffered stream code

10cbf66

Fixup unit tests

4ccf197

Add integration test for buffered

40168a4

buffered result

705275b

adrian-chang force-pushed the achang/plt-0-fix-bug-meta branch from d75ce34 to 705275b Compare May 26, 2024 05:34

Gabefire mentioned this pull request May 26, 2024

Export V1 to V2 for integration tests #1618

Merged

adrian-chang requested a review from sfendell-labelbox May 26, 2024 18:59

sfendell-labelbox approved these changes May 26, 2024

View reviewed changes

adrian-chang temporarily deployed to Test-PyPI May 26, 2024 23:56 — with GitHub Actions Inactive

adrian-chang merged commit 80c730c into develop May 27, 2024
22 checks passed

adrian-chang deleted the achang/plt-0-fix-bug-meta branch May 27, 2024 01:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[PLT-1008] Export content parses incorrectly #1628

[PLT-1008] Export content parses incorrectly #1628

Uh oh!

adrian-chang commented May 24, 2024 •

edited

Loading

Uh oh!

Uh oh!

vbrodsky left a comment

Uh oh!

Gabefire commented May 26, 2024

Uh oh!

Gabefire commented May 26, 2024

Uh oh!

adrian-chang commented May 27, 2024 •

edited

Loading

Uh oh!

adrian-chang commented May 27, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

[PLT-1008] Export content parses incorrectly #1628

[PLT-1008] Export content parses incorrectly #1628

Uh oh!

Conversation

adrian-chang commented May 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

vbrodsky left a comment

Choose a reason for hiding this comment

Uh oh!

Gabefire commented May 26, 2024

Uh oh!

Gabefire commented May 26, 2024

Uh oh!

adrian-chang commented May 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

adrian-chang commented May 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

adrian-chang commented May 24, 2024 •

edited

Loading

adrian-chang commented May 27, 2024 •

edited

Loading

adrian-chang commented May 27, 2024 •

edited

Loading